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COMMENTS  ON 


"THE  MEASUREMENT  OF  LINEAR  DEPENDENCE  AND 
FEEDBACK  BETWEEN  MULTIPLE  TIME  SERIES" 

BY  JOHN  GEWEKE 

by  EMANUEL  PARZEN 

Institute  of  Statistics 
Texas  A&M  University 

I  would  like  to  congratulate  Professor  Geweke  on  an 
interesting  paper.  I  believe  its  most  valuable  contribution 
is  to  stimulate  us  to  develop  improved  methods  for  modeling 
multiple  time  series  with  the  aim  of  determining  which  variables 
are  significantly  related. 

The  problem  of  modeling  multiple  time  series  is  one  on 
whose  theory  I  have  written  extensively  in  Parzen  (1967) ,  (1967a) , 
(1969),  (1977),  and  Parzen  and  Newton  (19G0) .  I  would  like  to 
show  how  results  and  notation  from  these  papers  help  us  to 
derive  and  clarify  the  results  presented  by  Geweke. 

Let  X(t)  and  Y(t)  be  multiple  time  series,  with  zero  means, 
jointly  normal,  and  jointly  covariance  stationary.  To  study 
the  relations  between  X()  and  Y(),  one  models 


The  covariance  matrix  R(v)  -  E[Z'(t)  Z(t+v)]  is  assumed 
to  be  sunmtable  so  that  the  spectral  density  matrix 

f(w)  -  l  e”2irivw  R(v)  ,  0<w<l, 

V**-oo 

exists.  Then  R(v)  -  ^e2lTivw  f(w)  dw. 
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The  joint  covariance  and  spectral  density  matrices  of  X  and  Y 
are  described  by  the  blocks  in  the  partitioned  matrices 


R(v)  =  |  Kxx(v)  RXY(v) 

ryx(v)  ryy(v> 


f(w)  =  j  fxx(w)  fXY(w) 
fYX(w)  fYY^ 


Autoregressive  analysis  models  Z(t)  by  a  joint  infinite  order 
autoregressive  scheme: 


'x(t)- 

X(t- 

-1) 

X(t-m) 

*  Ay  V  (1) 

+. . .+  Av  v(m) 

Y(t) 

A  |  X 

Y(t- 

■1) 

A,  I 

Y(t-m) 

AX,Y(J)  " 


+  n(t) , 

Axx^J)  axy^ 

ayxO)  AYY(j) 


1 


n(t) 


nx  (t) 
nY  ( t) 


joint  innovations 


A  preferred  notation  for  E  is  E(X, Y|x“ ,Y") . 

We  call  n(t)  the  joint  innovations,  and  E  the  joint 

innovation  covariance  matrix.  We  can  define  n(t)  as  infinite 


E  - 

EXX  EXY 

■1 

E[n' (t)n(t)]  | 

Eyx  ^yy 

i 

j 

♦ 

memory  prediction  errors: 


n(t)  =  /  X(t)\-Ef/k(t)'j 


X(I>1),  Y(t-1.)  ,  .  .  .  X(t-in)  , Y(t-m)  , 


The  joint  innovations  should  be  contrasted  with  the 
individual  innovations 

X  (t)  =  X(t)  -  E [X(t) | X(t-l) . X(t-m) , . . . J 

Y  (t)  =  Y(t)  EtY(t) | Y(t-l) .... , Y(t-m) , . . .] 
which  provide  individual  infinite  order  autoregressive  models 


X(t)  -  Ax j x ( 1 >  X<t-1)  +  . . -+AX|X  X(t-m)+. . .+X(t) 

Y(t)  -  AY|Y(1)  Y(t-1)+. . .+AY|Y  Y(t-m)+. . .  +  Y(t) . 

The  individual  innovation  covariance  matrices  are  denoted 

»  Z(X|X")  -  E[{X(t) }’  X(t) ] , 

ZY  -  Z  (Y  |  Y”)  -  E[{Y(t) }  Y(t) ] . 

The  innovation  innovations  are  defined  to  be  the  joint 
innovations  of  the  joint  time  series  l~X(t)*l  of  individual 

L*<t)J 

innovations.  A  remarkable  theorem  is  that  the  innovation 
innovations  are  identical  with  the  joint  innovations.  Thus  in 
practice,  one  can  determine  the  joint  innovations  of  X(t)  and 
Y(t)  by  first  "prewhitening"  them  to  form  X(t)  and  Y(t),  whose 
joint  innovations  are  then  determined. 


The  general  theory  of  multiple  time  series  discussed  below 
is  phrased  in  terms  of  general  stationary  time  series  X(t)  and 
Y(l).  However  it  works  best  in  practice  when  applied  to  time 
series  which  have  been  somewhat  pre-whitened. 

In  several  papers  on  model  identification,  Parzen  shows 
that  the  individual  innovations  of  a  time  series  are  essentially 
unique,  while  the  whitening  filter  which  generates  them  may  be 
expressed  in  diverse  ways  as  a  series  of  filters  representing 
detrending,  deseasonalizing ,  and  innovations  operations. 

To  model  Y(t)  we  compare  the  properties  of  the  prediction 
errors,  and  prediction  error  covariance  matrices,  corresponding 


to  five  sets  of  explanatory  variables: 

Prediction  error  Covariance  Matrix 

(Yv|Y~)(t)  =  Y(t)  -  E (Y(t) | Y(t-l) . Y(t-m) ]  I(Y|Y") 

(Yv|X',Y‘)(t)  -  Y(t)  -  E[Y(t) |X(t-l) . X(t-m) ,  E(Y|X",Y") 

Y(t-l) . Y(t-m)  ] 

(Yv|X+,Y')(t)  =  Y(t)  -  E(Y(t) |X(t) ,X(t-l) . X(t-m) ,  E(Y|X+,Y') 

Y<t-1) . Y(t-m)  ] 

(YV|X)(t)  -  Y(t)  -  E lY(t) | X(s) ,  -»<s<»]  E(Y|X) 

(Yv|X,Y')(t)  -  Y(t)  -  E[Y(t)|X(s),  -«<s»,  E(Y|X,Y") 

Y(t-l)  , Y(t-2) . Y(t-tn)  1 


It  should  be  noted  that  after  a  conditioning  sign  | ,  X” 
represents  the  past  of  X,  X+  the  past  and  present  of  X,  and  X 
the  past,  present,  and  future  of  X. 


-  J- 


The  spectral  densities  of  the  various  error  series  are 
denoted  fyV  |  y“  (w)  ,  fy V  |  ^ —  y“  (w)  ,  f yV  |  x"t  y “  (w)  ,  fy V  |  jj(w)  I 

fYv|X,Y" (w) • 


(1)  Y | Y 


j: <Y ( Y~)  is  the  individual  innovation  covariance 


matrix  Ey-,  (Yv|Y~)(t)  is  white  noise. 


(2)  Y 1 X~ ,  Y~ :  E (Y | X” ,  Y")  is  the  block  Eyy  in  the  joint 
innovation  covariance  matrix;  (YV|X~,  Y~)(t)  is  white  noise. 

(3)  Y|X+,  Y~ :  Parzen  (1967)  shows  (p.  401) 


ZYX  EXX 


_1  (Xv | X~ ,Y~) (t) ; 


(Yv |X  , Y~) (t)  =  (Yv|X_,Y')(t)  ■ 

E(Y|X+,Y")  =  Eyy  -  EyX  E^-^  E Xy 

(4)  Y | X:  Parzen  (1967)  shows  that  a  joint  autoregressive 
model  for  X(t)  and  Y(t)  provides  f(w)  from  which  one  can 
compute  the  statistical  parameters  of  the  representation 


by 


Y(t)  -  (Yu|X)(t)  +  (Yv j  X)  (t)  , 

(Yw|X)(t)  -  B(L)  X  (t),  B(z)  -  ?  B(k)Zk 

k— « 


B(e2niw)  -  fyX(w)  fxx'1(w) 

fyV  |x(w)  m  ^yyCw)  ■  ^XX  fjjy(w) 

E(Y|X)  -  ^ ^fyV | x(w)  dw- 

In  practice,  to  estimate  the  time  domain  coefficients  B(k) 

A  O  «r  4  m 

from  an  estimator  B  (e  )  one  uses  regression  methods. 


¥ 


% 


I 


j» 


X  . 
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| 


4.»-w 
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(5)  YjX,  Y~ :  As  Geweke  shows,  find  an  autoregressive  model 
for  (Yv|X)(t); 

gYV|X(L)  (Yv | X) ( t)  =  e(t). 

It  generates  a  white  noise  sequence  r(t)  which  can  be  identified 
with  (Yu |X,Y“) (t) .  Further  a  model  for  Y(t)  as  a  function  of 
X(s)  ,  -"•<s<m  and  Y(s),  s<t-l  is  given  by 

gYV|X(L)  Y(t)  =  gYujx<L)B(L)  X(t)  +  e(t) ; 

51  (Y J X ,  Y  )  =  Z£  “  covariance  matrix  of  e(t). 

The  time  domain  coefficients  of  the  filter  with  input  X(t) 

have  Fourier  transform 

„  ,,  -2niw,.  -2iriWv 

gyV  |  X  /  • 

If  one  needs  only  the  log  determinant  of  it  can  be  calculated 
without  fitting  an  autoregressive  scheme: 

log|£c  | -  J1  log) f Yv j x(w)| dw. 

One  need  not  actually  calculate  the  spectral  density  since 
Geweke' s  Theorem  1  shows  that 

In  |  E(Y|X,Y~)|  -  £n| E(X,Y|X" , Y") |  -  In  |E(X|X~)| 

The  meaning  of  the  various  definitions  of  feedback,  and  the 
formulas  for  them  given  by  Geweke's  Theorem  1,  is  easily 
understood  if  one  employs  the  notation  we  have  introduced. 


Measure  of  linear  dependence  (or  information)  y 
-  fn  det  X(X|X~)  -  In  det  X(X|X",Y) 

=  In  det  X  (Y  |  Y-)  -  In  det  E(Y|Y",X) 

Measure  of  instantaneous  linear  feedback :  y  ” 

=  In  det  X(X|X",Y~)  -  tn  det  X(X|X',Y+) 

=  In  det  IX Y|X",Y")  -  In  det Z(Y | X+, Y-) 


Measure  of  linear  feedback  from  Y  to  X:  Fy^„  * 

=  tn  det  E(X|X")  -  In  det  I(X|X‘,Y‘) 

=  In  det  E(Y|X+,Y“)  -  In  det  E(Y|X,Y“) 

Measure  of  linear  feedback  from  X  to  Y:  F^y  * 

=  In  det  E( Y|Y")  -  In  det  Z(Y|Y',X~) 

-  In  det  X(X|X",Y+)  -  In  det  X(X|X‘,Y) 

Theorem  1  in  Geweke's  paper  shows  that  there  is  a  crucial 

identity  from  which  the  equivalence  of  the  foregoing  definitions 

follows  immediately: 

In  det  E(X|X",Y+)  +  In  det  E(Y|X",Y~) 

-  In  det  E(X|X",Y)  +  In  det  Z(Y|Y") 

*  In  det  X(X,Y|X",Y") 

These  feedback  measures  seem  to  me  to  be  most  clearly 
interpreted  as  measuring  the  significance  of  various  variables 
as  independent  variables  in  a  model  for  Y(t). 
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Measure 

Variables 
in  Model 

In 

Words 

Variables  tested 
for  inclusion 

Add 

FX  »Y 

Y~ 

Past  Y 

Y'.X" 

Past  X 

t 

fx-y 

Y',X" 

Past  Y, 
Past  X, 

Y"  X+ 

>  » 

Present  X 

fy+x 

Y',X+ 

Past  Y, 
Past  X, 
Present 

Y'.X 

X 

Future  X 

fx,y 

Y" 

Past  Y 

Y“,X 

All  X 

This  table  also  exhibits  the  information  decomposition  of  the 
identity: 

FX,Y  fx^y  +  FX-Y  +  FY+X 

One  uses  Fx  y  to  compare  the  hypotheses  Ho:  model  Y  by  Y~ ; 

H^:  model  Y  by  Y",X.  In  addition  one  should  compute  E(Y|X) 
in  order  to  compare  the  hypotheses  Hq:  model  Y  by  X;  H^:  model 
Y  by  X,Y'. 

In  any  empirical  multiple  time  series  analysis,  one  should 
compute,  and  report,  !£,  Ey,  E  (the  individual  and  joint  innovation 
covariances) .  Then  one  should  compute  (and  test  for  significant 
difference  from  zero) 

fx->-y  “  de-t  eyy 

y  ■  -In  de.t  (I  - 

=  In  dit  Ejj  +  In  de.t  Ey  -  In  de.t  E 

Fy>x  ■  In  det  E^  -  In  de.t  E^ 

The  computation  of  these  determinants,  and  additional 
insight  into  the  relations  between  variables,  could  be  attained 
by  computing  the  eigenvalues  and  eigenvectors  of  the  matrices, 


EYX  EXX  1  EXY) 
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such  as  y~1(Y|Y”,X”)>(Y|Y”,X+) ,  whose  log  determinants  are 
being  calculated.  The  eigenvalues  can  bo  Interpreted  in  terms 
of  various  canonical  correlations  (see  Parzen  and  Newton  (19/9)). 

Finally,  to  estimate  (from  data)parameters  such  as  £, 

>y,  and  J'y,  it  is  strongly  recommended  that  one  use  approximating 
autoregressive  schemes,  and  order-determining  criteria  such  as 
CAT  (see  Parzen  (1974),  (1977)). 

As  with  any  excellent  piece  of  research,  Geweke’s  paper 
raises  many  open  questions,  some  of  which  have  been  alluded  to 
in  my  discussion. 

The  definition  of  the  feedback  spectral  measure  fy^Cw) 
given  by  Geweke  is  impressive.  Only  experience  can  show  us 
whether  it  should  be  routinely  computed  in  empirical  research; 
it  may  suffice  to  use  as  the  feedback  spectral  measure 

fX  Y^  =  *'n  1  "  fYY  fYX^  fXY^ 

computed  using  canonical  spectral  analysis  (see  Brillinger 
(1981),  chapter  10  for  references  to  this  literature). 
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